首页> 外文OA文献 >Automatic ontology-based knowledge extraction from web documents
【2h】

Automatic ontology-based knowledge extraction from web documents

机译:从Web文档中自动提取基于本体的知识

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

To bring the Semantic Web to life and provide advanced knowledge services, we need efficient ways to access and extract knowledge from Web documents. Although Web page annotations could facilitate such knowledge gathering, annotations are rare and will probably never be rich or detailed enough to cover all the knowledge these documents contain. Manual annotation is impractical and unscalable, and automatic annotation tools remain largely undeveloped.Specialized knowledge services therefore require tools that can search and extract specific knowledge directly from unstructured text on the Web, guided by an ontology that details what type of knowledge to harvest. An ontology uses concepts and relations to classify domain knowledge. Other researchers have used ontologies to support knowledge extraction,1,2 but few have explored their full potential in this domain.\udThe Artequakt project links a knowledge-extraction tool with an ontology to achieve continuous knowledge support and guide information extraction. The extraction tool searches online documents and extracts knowledge that matches the given classification structure. It provides this knowledge in a machine-readable format that will be automatically maintained in a knowledge base (KB). Users could further enhance knowledge extraction using a lexicon-based term expansion mechanism that provides extended ontology terminology.
机译:为了使语义网栩栩如生并提供高级知识服务,我们需要有效的方法来访问Web文档并从中提取知识。尽管网页注释可以促进此类知识的收集,但是注释很少见,并且可能永远不会足够丰富或详细以覆盖这些文档包含的所有知识。手动注释是不切实际且不可扩展的,并且自动注释工具在很大程度上还没有开发出来,因此专业知识服务需要能够在网络上直接从非结构化文本中搜索和提取特定知识的工具,并以本体为指导,该本体详细描述了要收集的知识类型。本体使用概念和关系对领域知识进行分类。其他研究人员已经使用本体来支持知识提取,1,2,但是很少有人探索其在这一领域的全部潜力。\ udArtequakt项目将知识提取工具与本体联系在一起,以实现持续的知识支持和指导信息提取。提取工具搜索在线文档并提取与给定分类结构匹配的知识。它以机器可读的格式提供此知识,该格式将自动保存在知识库(KB)中。用户可以使用提供扩展的本体术语的基于词典的术语扩展机制来进一步增强知识提取。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号